Midori Overvievv 


EE Microsoft 


Carry Microsoft across disruptive HVV trends 
* Multicore, heterogeneous processors 
* Intel, ARM 


Tackling "tough" problems 


* Security, correctness 


Pervasive blocking, interactivity and responsiveness 
Opportunistically looking at evolving legacy 
* Application model 
* State separation to enable cloud and roaming 
Leveraging the success of Windows 
* Desktop integration, host Windows apps 


* Loosely coupled systems for data center scenarios 


м 


How We Work 


Incubation ANN to ship quality with a strong "engineering culture" 
Design and code 
* Automated unit 


s for everything 
Test coverage 


il components 100 
+ Hygiene (PREFix, FXCop, etc.) and Performance Gates 


Lab available both on-demand and scheduled 


Team is all developers (no PMs or SDETs) 


Everyone codes: everyone loves to code 


Open атанан and code 


ed: feedback encouraged 


te architecture, coding, testing 


Apply the same culture for online services as the OS itself 


We're hiring! 


Midori OS Architecture 


Runs in HyperV or direct HW (x86, x64, ARM) Isolation in depth 


Small, policy-free microkernel (C++) for HW interaction + Softv ion between processes (type-safety) 


Domain kernel (C#) contains most policy: schedulers, * Hard 
memory, resource management 


lation between domains for untrusted 
code 


"No lies", no virtualization of resources (paging) Drawbridge for Windows compatibility 


Type safety, capability-based security * Host Windows apps in a "Library OS 
+ Most of the system * Drawbridge PAL transforms syscalls into Midori ABls 
+ All 3' party code, including device drivers as processes 


Domain 1 Domain 2 


Domain Services Midori App | Windows App 


EXCEL.EXE 
Library OS 
Drawbridge PAL 

es 


ABI 


Domain Kernel Domain Kernel 


Programming Model 


Concurrency for Eve 


"A programmer of ordinary skill writes code in a 
natural manner and achieves results that are safe 
and scalable in a fraction of the time compared to 
state-of-the-art concurrency models.” 


Development Platform 


Safety and Correctness By-Construction 
Correct, Secure 
- Resp e, High Performance, Scalable 


Midori се апа Aggressive Static Analysis 


+ Immutability, isolation, mode model (contracts 


CH Productivity with C++ Performance 


+ Eliminates whole classes of errors (corruptions, leaks 
Allocation-free approaches to common problems (e.g., sub-arrays, strings: 


Safe Сеһештепгу 


* Lightweight processe: concurrency “for free 
* No locks or threads: no race conditions, deadlocks, priority inversion: 

Best in Class s Developer Usability and Productivity 
+ Fast builds, clear separation betw een “App” and "OS". 


* Midori C# and Midori Development Kit (МОК) fully in ‘ated with Visual Studio 


Device Driver Model 


Each driver isolated in its ovvn process 
* No third party code in Ring 0 
+ All third party code is verifiably safe 
* Each driver runs as a process — failure is isolated 


Driver model based on a rich framework 


+ Same programming models as regular applications, no separate SDK and DDK 
+ Hardware resources are abstracted 
* Framework largely declarative; handles queuing and resource management 

* SATA driver is 60% smaller than Windows equivalent 

* Intel 82598/Linux: 29,163 lines; Intel 82599/Midori: 2,356 lines 


Driver support is complete enough to run on common hardware 
* 10GB NIC, SATA, USB 


* Devices: Tegra3, Dell 5x00 desktops, Bing server SKUs, X1 Carbon Touch laptop 
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are isolation between domains for untrusted 
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Domain Kernel Domain Kernel 


Device Driver Model 


Each driver isolated in its ovvn process 
* No third party code in Ring 0 
* All third party code is verifiably safe 
* Each driver runs as a process — failure is isolated 


Driver model based on a rich framework 


* Same programming models as regular applications, no separate SDK and DDK 
* Hardware resources are abstracted 
* Framework largely declarative; handles queuing and resource management 

* SATA driver is 6096 smaller than Windows equivalent 

* Intel 82598/Linux: 29,163 lines; Intel 82599/Midori: 2,356 lines 


Driver support is complete enough to run on common hardware 
* 10GB NIC, SATA, USB 


* Devices: Tegra3, Dell 5x00 desktops, Bing server SKUs, X1 Carbon Touch laptop 


Networking 


Complete client and server-side stack 
* TCP, HTTP, UDP 
* HTTPS on the client and server 
* Inter-process data transfer uses immutable data buffers 
* Uses formalism (Hierarchical State Machines) to manage complex concurrency 


Achieves performance requirements without sacrificing security 
* Separates the data and control paths 
* Inter-process data transfer uses immutable data buffers 


End to end "zero copy 
Buffers cacheable throughout the system, without risk 


* DMA Channel for efficient data transfer between applications, driver, and hardware 
* Can drive 10Gb NIC at wire spee d using a single CPU core 


y than Window 


Simpler architecture and more thar 


Network Security 
* Extensive fuzzing of various network protocol implementations on Midori 
* Not a single vulnerability that enables remote execution, validation of Midori's safety approach 


Case Study: SPECWeb99 Historical Data 


TCP Select 
10000 Hardware tso Receive — 
Weak Cache EE 
Software TSO - DMA enable web 
Channel server log 


RFS Update 


IPC 
Improvements 


Midori Development Kit (МОК) 


Developer experience is Visual Studio based 
* Reuse VS project system, as much of the IDE as possible 
* Some customization for build system and new tools (async debugging) 


Midori extends C£ with systems and safety features 
* Modern error model — exceptions and contracts 
* Isolation, immutability, and controlled side-effects 
* Lifetime analysis and stack allocation 
* 1* class support in Visual Studio: IntelliSense, statement completion 


System is packaged as a set of reference assemblies 
* Model is similar to .NET / Windows 8 
* Rich Framework, V1 targets Server and Cloud; Client soon afterwards 
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PROGRAM MODELS 


ProgramModel 


Assets 
Console 
Configuration 
Deployment 


FRAMEWORK 


Networking 


Drs 
Http 
Sockets 
WebSockets 


Security 


Credentials 
Cryptography 


BASE CLASS LIBRARIES 


Collections 
UsteT>, Мар 
StackeT>, QueuecT», 


WebServer 


ServerPages 


Storage 


Records 


Filesystem 


Devices 


SysternClock 


Text 


Encodings 
RegularExpressions 
Unicode 


Platform 


String, Char, int32, int64, 
Funce..>, Arten 


m 
Parsing 


Graphics 


Reactive 
Kreis 


Languages 


JavaScript 
MidoriCSharp. 


Concurrency 


Data Parallelism 


Mathematics 


Arithmetic 


Probabilities 


CompilerServices 
Remoting 


Runtime 


Testing 


Predicat 
UnitTests 


Multimedia 


Drivers 


жр 
DriverFramework 
ға 


Globalization 


DateTime 
Formatting 
Strings 


Diagnostics 


Eventing 
PerformanceCounters 


ll 


Concurrency Model 


А қ i Lightweight, isolated 

Process Parallelism i processes 
SpeechEngine engine = factory.CreateProcess<SpeechEngine>(..); oun В 
SpeechRecoResult reco = await engine.Recognize( 


Blocking” coroutines 


Task Parallelism { 
н .i Spawn a task, no unsafe 
immutable LanguageModel model s state capture 
isolated List<Phoneme> phonemes x .; Сн 
AsyncResult«RecoScore» scoreTask = 
async.Run([consume phonemes] () => { .i Ownership transfer of 
engine.Score(model, phonemes H mutable state 
Di 
j “4 Safe use of 
RecoScore finalScore = await scoreTask; immutable data { 


Data Parallelism (Implicit, Libraries, Declarative Models) 
// Example LINQ queries with side effects are banned 
var myQuery from x in xs where x.f++ select x; 


` Race condition rejected at 
compile-time 


Deep Immutability in Frameworks 


public immutable class MyComparator : AComparator 
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Scheduling and Resource Management 


Kernel, User Mode Schedulers ‘Asynchronous 
* Message passing, task and data parallelism 
* Cooperative scheduling: less preemption, long quanta, warm caches 


* Unified interrupt dispatch pathway 


“Non-blocking” user code 
+ Linked stacks and small coroutines 
* Begin life with a small stack, adaptively grow as needed 
+ Mostly CH await model — but efficient waiting, no state machines 


Domain Kernel 


System-Wide Visibility 
* Mechani: embedded into the platform 
* Policy evaluated at multiple levels, all the way up to application code 


sm 


* Not expressed using "priorities 


System understands all interactions between processes and domains 
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Case Study: Lync 


© Windows 


Applications explicitly manage the threads 
and the kernel scheduler 


Some threads are set manually to 
real-time values! 


Lync 
More than 90 threads of varying priorities 
(Some at priority 21!) 


Hard to get right 
Breaks down under stress 
Interacts with other programs poorly 


ol Midori 


No threads for applications to manage 


Resource management is done 
declaratively 


Midori Lync 
Uses 17 processes, all sharing a common 


scheduler reservation, the resource manager 
handles the rest 


Simple to get right 
Stable under stress 


Interacts vvell vvith other programs 
14 


Application Model 


Midori programs are comprised of many small processes 
* Each connected by an interface written in C£ 
* Standard asynchronous RPC — server stub, client proxy 


A proxy to a remote object is a “capability” 
* Used to send asynchronous messages 
* Transferrable across processes 
* Used to expose all system services 
* Used for all I/O (disk, network, graphics, etc.) 
* Not forgeable, all security is capability-based, no ambient principal 


By construction, applications are 
* Concurrent (able to do many things at once) 
* Responsive (tolerant of high latency operations) 
* High Throughput (able to speculate and pipeline) 


Message Passing in Four Steps 


1. Declare a service interface, marked [Eventual] 


2. Compile the IDL project (or skip, if single project) Midori Build 


3. Implement the generated server-side stub 


4. You're ready to use the service via its client-side proxy 


peechEngine engine 
chResult = engine. Recognize 
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Message Passing Performance 


Asynchronous model masks latency 
* Asynchronous masks latency 
* Pipelining reduces round-trips 
* Speculation absorbs latency 4 
* Flow control improves efficiency «Я 


Stream audio = 

SpeechEngine engine = mgr.GetEngine("en-US"), d 
SpeechResult result = engine.Recognize(audio) ; ә 
AsyncResult<string> text = result.RecognitionText; 
Console.writeLine(try await text); 
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Error Model 


Robust error model for systems programming 
* Fail fast discipline — if it's a bug, stop running code "now" 


Contracts declare pre- and post-conditions 
* Declarative, side-effect-free 
* Compiler analysis enlightened, optimizes 
* The 9596 case; most places where .NET employs exceptions 


Checked exceptions for "dynamic" / recoverable failures 
* By default, methods may not throw 
* Not possible to circumvent, allows compilers to optimize 
* A method can mark itself 'throws' to opt-in 


* Optionally, methods may restrict the set of exceptions thrown 
* Callers may only depend on this if annotated — no dependencies on unstated behavior 


Deterministic destruction for eager resource reclamation 
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Storage 


Designed for modern storage devices and application I/O patterns 
* SSD, not spindle 


Log Structured Transactional Record Store 
* Reliability and consistency of a database Domem 

ighly-scalable lock-free architecture 

ppend-only, space reclamation (GC) based on policy 


> г 


Update 
Intention 
Appended 
to Shared | 
Log i 


ң XRS/Meld 


Application friendly 

* Efficient snapshots and replication 
apability-based access control model 
eclarative record-based programming model 


gom 


Shared Reliable Log 


DC 


Opportunity for data center cost and povver savings 
* Shared distributed logs using custom hardware 
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Storage 


Designed for modern storage devices and application I/O patterns 
* SSD, not spindle 


Log Structured Transactional Record Store 
* Reliability and consistency of a database Domain 

ighly-scalable lock-free architecture 

ppend-only, space reclamation (GC) based on policy 


wE 


Update 
Intention 
Appended 
to Shared | 
Log i 


ң XRS/Meld 


Application friendly 

* Efficient snapshots and replication 
apability-based access control model 
eclarative record-based programming model 


gom 


Opportunity for data center cost and power savings 
* Shared distributed logs using custom hardware 
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Storage Schemas and Records 


ПЕС TES 
р Gah А 


Schema Provider 


Transactional Record Store 


File System Schema 


[ве 1 
public sealed partial class FileRe 
{ 

The key is composed of 


Depth is how deep the File 
Name refers to the enti 
organization allows 

[Recordkey(Ordinal = @)] 
readonly ushort m pathDepth; 


[Recordkey(Ordinal = 1)] 
readonly string m fullName; 


[Recordcontent] 
DateTime m_creationTime; 
[Recordcontent] 
ateTime m updateTime; 

Entire file content is stored in this field 

Tiny 512 bytes) are 

Small sizes («3k bytes) same segment 

arger written independently and may span segments 
[Recordcontent] 
treamSource m content; 


Cloud Data Center 


Next generation Cloud support, standard SXUs, 
standard deployment mechanisms 
+ Midori is designed for the next generation data center 
* Cloud-based code and data deployment 
* Running on Autopilot V9, V10, and V11 SKUs 
* Excellent teamwork with Autopilot team 


Full integration with Bing data center automati 
* Automated deployment, 


imageName Midori.retail.amd64 
servicing, and reinstallation 


* Midori Server looks like a Windows Server to Autopilot 


Best in class debugging and monitoring 
* Events and Perf Counters viewable in XTS 
* Disk-based crash dumps, diagnosability is built-in 
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Web Programming Model 


Midori Web Server 
* Full HTTP support, including streaming and chunking 
* Highly configurable, admission control, web apps and metabase 
* Powerful object model, ASP.NET Razor also available 


Rich Web Services stack 
* Declarative XML and JSON serialization 
* Unified programming model with message passing for program services 


[Eventual] 
[WebService] 
public interface ISpeechService 


[WebMethod(Path="query") ] 

Stream Recognize( 
[WebArgHeader(Namez"Content-Type")] string contentType, 
[WebArgQuery] string locale, 


WebSockets coming online as we speak 
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Case Study: Midori Speech Server 


Picked Speech Recognition as an excellent workload and partner 
important, to the company going forward (Xbox, Windows, Phone, TV) 


drives 2X usage 


TY, at least 5X 


mersive experienc 
rvatively, basic math 'ech usage v 10X in the next 


26 Challe anging to meet latency and quality requirements has 


year 


hit the CPU wall 


Started with existing C++ assets, architected a modern Speech Service 
* Highly concurrent, Streaming rather than Batch 
“ 370 KLOC of unsafe C++ became 115 KLOC of safe Midori C# 
* Server simplification, modernized code (collections, error model, GC) 


Successfully delivered direct customer value 
Midori Speech Server now taking 2096 live Windows Phone traffic; began on 12/12/2012 
* Improved customer experience, 4096 latency improvement 
* 3-4x better data center density and capacity 
* Able to deliver Deep Neural Networks (DNN), further improving quality of Speech Recognition 


Windows /C++ — ------- 


Manual Streaming 
Implementation 


Windows / C++ 


Manual Streaming 
Implementation 


Midori / C# 
class AudioQueue 


Stream m input; 


public throws awaits int Read(in Spancbyte> bytes, int readCount) 


requires readCount 
requires readCount 


return try await m input.Read(in bytes, 8, 


=e 


ytes.Length 


readCount) ; 


Case Study: Speech Server Performance 


At more than triple the load, Midori delivers customer-visible latency gains 
* Windows Phone beats latency target at 3x load; Xbox at 4x 


1 г 18 E в 1 п 35 
) Bar ом ^ ON м 
Midori 1 H 18 Di 28 1 H 18 ЕД 35 
Request: 031 254 507 732 748 0.49 439 8.68 128 16.04 
298 мз 495) nn 8669 399 nn o мп бз өз? 
19353 21368 234.28 606855 735.64) 582 32.24 5149 9429 


Windows 
based ч 
server does 
not finish 
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Case Study: Improving Speech Quality, DNN 


n on Midori 


We have successfully enabled Deep Neural Network (DNN) recogni 
+ MSR research to improve quality (Word Error Rate [WER]) 


Industry standard for “learning and recognition”, thanks to performance breakthroughs 


GMM WER DNN WER Improw 


еп 9544 29.24% 25.58% 12.5% 
en-US Dictation 3309 21.26% 19.12% 10.1% 


Performance on Midori is good enough to use in production — both latency and scaling 


Will flight en-US DNN in April (5% of traffic); it-IT and es-ES shortly thereafter 
Lower latency Excellent scaling 
Windows Midori Improved 
19353 21368 23428 
12142 13872 18313 
63% 65% 78% 


Ongoing joint development effort between Midori , MSR, and IPE 
* Speech engineers writing code and using Midori 
* Training is still difficult; Google trains on 10X data, distributed, gets WER of 12.396" 
+ Long-term partnership on distributed DNN and machine learning 


ttp//research.m ftcom/apps/pubs/defaultaspe 498 


Case Study: SPECWeb05 


Speech results consistent with other workloads 


34,800 9,000 
70,600 500 


47,000 


78,442 


Gb NIC 


and Mide әгі runs per benchmark rules 


SPEC.org, except for custom tunings that improves Windows performance 


pecWeb05 Performance.aspx 


Why Great Performance? 


Midori architecture; “hands off” approach to domain-specific aspects of benchmarks 


Excellent юе, immutability 
. y IO pathw 


a state (0.582CPI vs, 0.778CPI) 
Safe deiere no locks 
de used tricky (and costly) lock-free code 
Built-in scalable scheduler 
“ No priority inversions, longer quanta 
* Wind de used manual thread pool implementation, dangerously used priorities 
saw the same thing last year with Lync on Midori 


Safe, highly optimized managed code (Bartok, Phoenix) 
196 GC tim 
and ir 


nanual m agement, 


er-module optimizations 


25% more on its way 


* Profile guided optimizations (PGO), parallelism 
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Midori U 


Ride the momentum of HTML5 and Windows 8 UI 
* Same story for Web Apps and Rich UI Apps 
* Declarative specification, layout via CSS 
* Use existing assets whenever possible (WIX, JavaScript, D2D) 


Solve key issues with Windows applications 
* Unsusceptible to "screen bleaching" (aka, the STA problem) 


* Safe code and capability-based security 
* Impervious to Ul injection attacks, escalation of privilege 


Functional/Reactive core 
* Formal model for how changes are made and observed 
* Provides a clear, consistent timeline as the state model 
* Classically suffers from size/speed penalties due to many objects, dependencies 
* Midori FRP "up-sizes" behaviors — one tree is a behavior 
* Compact array representation to save memory; no GC pointers 
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Midori Browser 


Flagship Midori UI application 
+ Demonstrate the value of Midori's UI architecture 
veasonable subset" of HTML 5, CSS 3, and JavaScript 
+ 1Е10 JavaScript engine integrated with Midori runtime (object layout, stacks) 
* Competitive performance with Microsoft's flagship browsers and runtimes 


Weather Application "Port" 


Changed 88 out of 398 lines 
>” of HTML 


Specific Code 98% lines of JavaScript 
of code untouched Changed 6 out of 1,570 
lines of CSS 
Windows 


JavaScript Library 
(WinJS) 


Drawbridge 


A “virtualization” technology from MSR for 
hosting isolated Windows apps 
* Mostly unmodified Windows DLLs 


* Host OS (Midori) implements Drawbridge ABI 
ibrary OS 


Application 


* Drawbridge PAL hosts Library OS (NT) and a Drawbridge 
(Windows) application on Midori 


* Allows us to run Microsoft Office on Midori, for example m 
Midori 
Runtime 


Midori's AppCompat story for rich Windows apps 
rawbridge 
PAL 


Drawbridge Performance 


Midor run Dr Wbrid V ALIVE 
Physical Memory Usage with Rdp Physical Memory Usage with Rdp No UI 
450 450 
350 So 
БЕ e 37 9 Window 
150 i a Midori 
HN d 


Paint 


'owerPoint 


Physical Memory Usage without Rdp Startup Time with Rdp 


50 $ sx 
200 É An * Midori 
150 a Midor = 


Paint Wordpad Excel PowerPoint — Link Wordpad Excel 
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Midori Tools 


Unified compiler infrastructure for C# and C++ 


* Ahead-of-time compilation 
* Optimizer improvements accrue to both native and managed code 


* Take advantage of Midori execution model 
* Example: JavaScript engine's unified object layout, linked stacks 


Use existing tools (e.g. Visual Studio) whenever possible 


Target many architectures 
+ x86, x64, ARM Tegra3, MDIL 
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C# Integer Benchstone x64 


IT seconds NGEN seconds Feb-2013 seconds JIT/Feb-2013 — NGEN/Feb-2013 
benchi /02 (lower is better) (омегіз better) (lower is better) (higher is better her is better) 
Bqueens.c 4.79 4.85 5 096 087 
pue | эзз| 455 аз 118 сіз 
addarra2.c 1723 1722 1393) 124 124 
addarray.c 2145 2.44 1491| 144 144 
arrayl.c 8.42 8.43 7.86 1.07 1.07 
array2c 30.85 48,04 0.64 0.64 
benche.c 14.79 10.64) 1471.39 
binserch.c 10.01 7.68) 1.39 130 
bubsort.c 17.73 1382) 128 128 
bubsort2.c 14.59 10.82 128 135 
csieve.c | 11.38 9.15) 125 124 
fib.c 7.81 0.01) 934.00 781.00 
heapsort.c 1112] 118 143 
iniarray.c 2.66) 10.35 10.16 
logicarr.c | 2246 1681} 133 1.34 
midpoint.c 18.49 2102 0.88 0.88 
mulmtx.c 20.66 13.42 1.54 1.54 
indhrysto.c. 28.64 17.59) 157 1.63 
permutat.c 10.14 721) 113. 141 
ple 1095 8.6) 127 127 
ршіес 14,64 27| 114 145 
quicksrt.c 12.71 1127 114 133 
shellsrt.c 24.26 20.86) 1.17 116 
sq текс 17.75 947 187 187 
treesort.c 3.69 3.76) 1.02 0.98 
tree ins.c 9.16 9.29) 0.99 0.99 
хроз mtx.c 3401 30.13) 1.13 143 

14.08 1136 124 124 
13.83 8.40 166 165, 
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SPEC2K6 x64 /O2 vs. VS 


1KG3 seconds 
(lower is better) 


CPU2006 INT /O217 


astar 
bzip2 

кс 

gobmk 
h264ref 
hmmer 
libquantum 
mef 
omnetpp 
peribench 
ЕТІ 
xalancbmk 
Geomean 


CPU2006 FP /0217 


dealll 
Ibm. 

mile 
пата 
povray 
soplex 
sphim3 
Geomean 


806 € 
5548 


ma 


(lower is better) 


515.4 
449) 


Feb-2013 seconds 
(lower is better) 


814. 
8397 
4666 
6849 
8349, 
013 
605.8. 


LKG3 seconds Ғеһ-2013 seconds 
(lower is better] 


her is better) 


102 
102 
104 
107 
107 
0.79. 
104 
0.97 
104 
105 
104 
108 
102 


0.99 
0.99 
100 
102 
118. 
103 
102 
103 


Is better 


677732 
496964 
204784 
94143 
69112 
547996 
924626 


154147 


2157540. 


330,628 


lKG3/Feb-2013 LKG3 bytes (lower 


(higher is better) is better) 


573876 
87874 
375944 
934922 
385832 


299,614 


lKG3/Feb-2013 163 bytes (lower ^ Feb-2013 bytes 
(lower is better) 


97440 
100645 


231361 


514059 
195730 


69176. 
554201 
232910 


149747 


2144857 


328,980 


Feb-2013 bytes 
(lower is better) 


621763 


8928: 
164001 
362612 
919081 
374103 


197933 


301,156 


Feb-2013/LKG3 
(lower is better) 


104 


Feb-2013/LKG3 
(lower is better) 


108 
102 
105 
0.96 
0.98 
0.97 
0.97 


10, 
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Tools Improvements 


Optimization 
+ Profile-guided optimization 
* Improved runtime check elimination, driven by Speech 
* SSE intrinsics for managed code 
* Better interprocedural optimizations for managed code 


Midori programming and execution models 
* Efficient asynchronous code using linked stacks 
* Frozen static objects 
* Class initialization at process start-up 


Developer experience 
* Optimized code debugging to help diagnose production failures 


* Improved compiler throughput for debug builds 


Continued innovation over the next year (PGO, etc) 
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Summary: Great Team, Great Year 


Last year, we felt good about architecture and performance 
* Focus had been on architectural "hard" problems, performance 
* Competitive results on industry-standard client and server benchmarks 
* Demonstrated "real time" Lync on Midori, great responsiveness 
* Steve & Qi asked us to tackle a real production workload 


This year, Midori is delivering production Speech Recognition in Bing 
* Great safety, great performance 
* Immediate and very visible customer impact 
* Demonstrable capacity improvement, potential for cost savings 


Overall, a great validation of Midori's unique architecture 
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What's Next for Midori 


Continuing momentum into production this coming year 
* Finish V1 of the kernel and development platform 
* More performance, platform innovation on its way 
* Speech to 10096, geocontinental 
* Deep Neural Networks and Distributed Learning 
* Continue exploring partners and workloads in OSD and Azure 


All design docs and code accessible to Microsoft FTEs 
* http://midori/Midori Design Notes 


Reminder - We are hiring! 


Q&A 


